1,293 research outputs found
Discovering Organizational Correlations from Twitter
Organizational relationships are usually very complex in real life. It is
difficult or impossible to directly measure such correlations among different
organizations, because important information is usually not publicly available
(e.g., the correlations of terrorist organizations). Nowadays, an increasing
amount of organizational information can be posted online by individuals and
spread instantly through Twitter. Such information can be crucial for detecting
organizational correlations. In this paper, we study the problem of discovering
correlations among organizations from Twitter. Mining organizational
correlations is a very challenging task due to the following reasons: a) Data
in Twitter occurs as large volumes of mixed information. The most relevant
information about organizations is often buried. Thus, the organizational
correlations can be scattered in multiple places, represented by different
forms; b) Making use of information from Twitter collectively and judiciously
is difficult because of the multiple representations of organizational
correlations that are extracted. In order to address these issues, we propose
multi-CG (multiple Correlation Graphs based model), an unsupervised framework
that can learn a consensus of correlations among organizations based on
multiple representations extracted from Twitter, which is more accurate and
robust than correlations based on a single representation. Empirical study
shows that the consensus graph extracted from Twitter can capture the
organizational correlations effectively.Comment: 11 pages, 4 figure
Scale-Adaptive Group Optimization for Social Activity Planning
Studies have shown that each person is more inclined to enjoy a group
activity when 1) she is interested in the activity, and 2) many friends with
the same interest join it as well. Nevertheless, even with the interest and
social tightness information available in online social networks, nowadays many
social group activities still need to be coordinated manually. In this paper,
therefore, we first formulate a new problem, named Participant Selection for
Group Activity (PSGA), to decide the group size and select proper participants
so that the sum of personal interests and social tightness of the participants
in the group is maximized, while the activity cost is also carefully examined.
To solve the problem, we design a new randomized algorithm, named Budget-Aware
Randomized Group Selection (BARGS), to optimally allocate the computation
budgets for effective selection of the group size and participants, and we
prove that BARGS can acquire the solution with a guaranteed performance bound.
The proposed algorithm was implemented in Facebook, and experimental results
demonstrate that social groups generated by the proposed algorithm
significantly outperform the baseline solutions.Comment: 20 pages. arXiv admin note: substantial text overlap with
arXiv:1305.150
When Social Influence Meets Item Inference
Research issues and data mining techniques for product recommendation and
viral marketing have been widely studied. Existing works on seed selection in
social networks do not take into account the effect of product recommendations
in e-commerce stores. In this paper, we investigate the seed selection problem
for viral marketing that considers both effects of social influence and item
inference (for product recommendation). We develop a new model, Social Item
Graph (SIG), that captures both effects in form of hyperedges. Accordingly, we
formulate a seed selection problem, called Social Item Maximization Problem
(SIMP), and prove the hardness of SIMP. We design an efficient algorithm with
performance guarantee, called Hyperedge-Aware Greedy (HAG), for SIMP and
develop a new index structure, called SIG-index, to accelerate the computation
of diffusion process in HAG. Moreover, to construct realistic SIG models for
SIMP, we develop a statistical inference based framework to learn the weights
of hyperedges from data. Finally, we perform a comprehensive evaluation on our
proposals with various baselines. Experimental result validates our ideas and
demonstrates the effectiveness and efficiency of the proposed model and
algorithms over baselines.Comment: 12 page
Shilling Black-box Review-based Recommender Systems through Fake Review Generation
Review-Based Recommender Systems (RBRS) have attracted increasing research
interest due to their ability to alleviate well-known cold-start problems. RBRS
utilizes reviews to construct the user and items representations. However, in
this paper, we argue that such a reliance on reviews may instead expose systems
to the risk of being shilled. To explore this possibility, in this paper, we
propose the first generation-based model for shilling attacks against RBRSs.
Specifically, we learn a fake review generator through reinforcement learning,
which maliciously promotes items by forcing prediction shifts after adding
generated reviews to the system. By introducing the auxiliary rewards to
increase text fluency and diversity with the aid of pre-trained language models
and aspect predictors, the generated reviews can be effective for shilling with
high fidelity. Experimental results demonstrate that the proposed framework can
successfully attack three different kinds of RBRSs on the Amazon corpus with
three domains and Yelp corpus. Furthermore, human studies also show that the
generated reviews are fluent and informative. Finally, equipped with Attack
Review Generators (ARGs), RBRSs with adversarial training are much more robust
to malicious reviews
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Large Pre-trained Transformers exhibit an intriguing capacity for in-context
learning. Without gradient updates, these models can rapidly construct new
predictors from demonstrations presented in the inputs. Recent works promote
this ability in the vision-language domain by incorporating visual information
into large language models that can already make in-context predictions.
However, these methods could inherit issues in the language domain, such as
template sensitivity and hallucination. Also, the scale of these language
models raises a significant demand for computations, making learning and
operating these models resource-intensive. To this end, we raise a question:
``How can we enable in-context learning without relying on the intrinsic
in-context ability of large language models?". To answer it, we propose a
succinct and general framework, Self-supervised IN-Context learning (SINC),
that introduces a meta-model to learn on self-supervised prompts consisting of
tailored demonstrations. The learned models can be transferred to downstream
tasks for making in-context predictions on-the-fly. Extensive experiments show
that SINC outperforms gradient-based methods in various vision-language tasks
under few-shot settings. Furthermore, the designs of SINC help us investigate
the benefits of in-context learning across different tasks, and the analysis
further reveals the essential components for the emergence of in-context
learning in the vision-language domain.Comment: Accepted by ICCV 2023; Camera Ready Versio
- …